Markov games with frequent actions and incomplete information

نویسندگان

  • Pierre Cardaliaguet
  • Catherine Rainer
  • Dinah Rosenberg
  • Nicolas Vieille
  • P. Cardaliaguet
چکیده

We study a two-player, zero-sum, stochastic game with incomplete information on one side in which the players are allowed to play more and more frequently. The informed player observes the realization of a Markov chain on which the payoffs depend, while the non-informed player only observes his opponent’s actions. We show the existence of a limit value as the time span between two consecutive stages vanishes; this value is characterized through an auxiliary optimization problem and as the solution of an Hamilton-Jacobi equation. Key-words: Markov games, incomplete information, zero-sum games, Hamilton-Jacobi equations, repeated games. A.M.S. classification : 91A05, 91A15, 60J10

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Markov Games with Frequent Actions and Incomplete Information - The Limit Case

We study a two-player, zero-sum, stochastic game with incomplete information on one side in which the players are allowed to play more and more frequently. The informed player observes the realization of a Markov chain on which the payoffs depend, while the non-informed player only observes his opponent’s actions. We show the existence of a limit value as the time span between two consecutive s...

متن کامل

Existence of optimal strategies in Markov games with incomplete information

The existence of a value and optimal strategies is proved for the class of twoperson repeated games where the state follows a Markov chain independently of players’ actions and at the beginning of each stage only player one is informed about the state. The results apply to the case of standard signaling where players’ stage actions are observable, as well as to the model with general signals pr...

متن کامل

Utilizing Generalized Learning Automata for Finding Optimal Policies in MMDPs

Multi agent Markov decision processes (MMDPs), as the generalization of Markov decision processes to the multi agent case, have long been used for modeling multi agent system and are used as a suitable framework for Multi agent Reinforcement Learning. In this paper, a generalized learning automata based algorithm for finding optimal policies in MMDP is proposed. In the proposed algorithm, MMDP ...

متن کامل

Identification and Estimation of Incomplete Information Games with Multiple Equilibria∗

The presence of multiple equilibria in games is a big challenge for identification and estimation. Without information of the equilibrium selection, it is impossible to perform counterfactual analysis. Allowing for possibly multiple equilibria, this paper provides nonparametric identification of finite games with incomplete information. Upon observing players’ actions from cross-sectional games...

متن کامل

Semiparametric Estimation of a Dynamic Game of Incomplete Information

Recently, empirical industrial organization economists have proposed estimators for dynamic games of incomplete information. In these models, agents choose from a finite number actions and maximize expected discounted utility in a Markov perfect equilibrium. Previous econometric methods estimate the probability distribution of agents’ actions in a first stage. In a second step, a finite vector ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017